Video signal processing

ABSTRACT

A method of de-interlacing a video signal by conducting three linear filtering operations on the input to produce three filtered signals, each linear filtering operation comprising the taking of a weighted sum of pixels; and multiplying together the three filtered signals to produce an output video signal.

This invention relates to video signal processing and is particularly concerned with non-linear filtering.

It has been found that in a wide variety of video signal processes—including de-interlacing, decoding, enhancement, noise reduction, and standards conversion—considerable advantage can be secured by the use of complex non-linear filters. It has been found in particular that polynomial filters can be very useful. In many applications, quadratic behaviour in the filter is not sufficient and third or higher orders are typically necessary. Where real time operation is required, hardware implementations are usually essential and the hardware costs of such high order polynomial filters are substantial.

It is an object of the present invention to provide improved methods and apparatus in video signal processing which offer third or higher order behaviour in a relatively simple filter architecture.

Accordingly, the present invention consists, in one aspect, in a method of video signal processing, comprising the steps of conducting three linear filtering operations on an input video signal to produce three filtered signals, each linear filtering operation comprising the taking of a weighted sum of pixels; and multiplying together said three filtered signals to produce an output video signal.

Suitably, the weighted sum is taken over pixels of the input video signal defined by a filter aperture and, preferably, all three linear filtering operations have the same filter aperture.

In one embodiment, for at least one linear filtering operation, the taking of a weighted sum of pixels includes the output pixel of the respective linear filtering operation.

In another aspect, the present invention consists in apparatus for video signal processing comprising an input terminal for receiving an input video signal; first, second and third linear filters each connected with the input terminal and arranged to provide an output through taking a weighted sum of pixels; a first multiplier for multiplying together the respective outputs of the first and second filters; and a second multiplier for multiplying together the respective outputs of the first multiplier and the third filter to produce an output video signal.

Advantageously, a filter is interposed between the output of the first multiplier and the second multiplier.

Preferably, the apparatus further comprises a linear filter path connected with the input terminal, and a combiner for combining the outputs of the linear filter path with the output of said second multiplier.

Suitably, a filter is interposed between the output of the second multiplier and said combiner.

The invention will now be described by way of example with reference to the accompanying drawings in which:

FIG. 1 is a block diagram of video signal processing apparatus according to the invention, in the form of a vertical de-interlacing filter; and

FIG. 2 is a diagram similar to FIG. 1, illustrating a modification.

The example will be taken of a de-interlacer and, for reasons of clarity, a de-interlacer will be described that utilises only vertical information. It will be understood that horizontal and temporal information could be included in ways which will immediately evident to the skilled reader.

In FIG. 1, the new architecture can be seen to consist of two signal paths. A linear signal path 10 contains a traditional, vertical, six tap, linear filter (h_(lin)) which has a typical sin(x)/x structure. If this were to be used without the non-linear signal path it would produce reasonable pictures, but they would contain some artefacts due to the interpolation process, notably jagging on diagonal and curved edges.

In the non-linear signal path 20, the output of two four point linear filters (h₁ and h₂) are multiplied together and passed through a two point linear filter (h₄). The output of this is then multiplied with the output of a five point linear filter (h₃). The resulting signal is filtered through another two point linear filter (h₅) before being added on to the linear path. Although in this case the filter lengths are 4, 5 and 2, larger filters with more taps can be used to give better results. The lengths (or more generally, the sizes) of the filters need not be related and can be made larger or smaller to provide different trade-offs between quality and cost.

It will be recognised that the arrangement of FIG. 1 serves to generate the “missing” lines in a de-interlacing operation. These new lines will be combined in a multiplexer with the (suitably delayed) “original” lines.

The filter coefficients can be selected by ‘training’ the filter on real pictures. In this example of de-interlacing, a still frame is taken and split into fields. A set of coefficients is used to estimate Field 2 from Field 1 and the mean squared error between the estimate of Field 2 and the original Field 2 is measured. A genetic algorithm can then be used to search the multi-dimensional filter space for the set of filter coefficients that gives the lowest mean squared error.

If the described non-linear de-interlacer is tested on the EBU/SMPTE test picture “Girl with Toys”, the non-linear path is found to reduce the average mean squared error by approximately 15% with respect to the linear filter. There is also a noticeable reduction in jagging.

A polynomial filter with the same number of input pixel taps produces an almost equivalent reduction in error. However, a major advantage of this new architecture over the polynomial filter can be seen by considering the number of multiplications of pixels; multiplications of pixels by a constant; and additions, that each filter requires. These are shown in Table 1.

TABLE 1 Comparison of complexity of filters Polynomial New Filter Architecture Multiplication 50  2 Multiplication by 34 23 a constant Additions 34 24

It can be seen that the largest reduction is in the multiplication of pixels. This is particularly significant as these are the most expensive to implement.

In summary, the new architecture is able to reduce many of the artefacts associated with traditional linear interpolation whilst being relatively simple to implement.

FIG. 2 illustrates a modification in which the architecture is simplified through omission of the filters h₄ and h₅. In other words, the direct product is formed of the outputs of filters h₁, h₂ and h₃ without intervening filtering of the product of the outputs of filters h₁ and h₂. This may under some circumstances produce less ideal filter behaviour, but the reduction in hardware complexity will often more than compensate. A particular advantage is that the three remaining filters can all make use of the same memory architecture.

It should be understood that this invention has been described by way of example only and that a wide variety of modifications are possible without departing from the scope of the invention. Thus, whilst the separation into linear and non-linear paths offers important advantages, such as the option to preserve higher bit accuracy in the linear path, it will not always appropriate. Similarly, the described use of vertical filters is—as has been explained—merely an example. Horizontal, vertical and temporal filters can be employed and filters can have one, two or three of these dimensions. Whilst Finite Impulse Response (FIR) filters will be important, the invention also encompasses other forms of linear filter such as recursive filters which include The output pixel in the weighted sum. The filters which are to be multiplied together need not be of the same category. However, providing three FIR or transversal filters with the same filter aperture ensures that in the multiplication of the three filtered signals, all possible cross products of input pixels are made available.

It will be recognised that although de-interlacing has been chosen as an example, filters according to the present invention can be applied to other problems in video processing, including composite to component decoding, enhancement, noise reduction, up and down conversion and standards conversion. 

1. A method of video signal processing, comprising the steps of conducting three linear filtering operations on an input video signal to produce three filtered signals, a first signal, a second signal, and a third signal, each linear filtering operation comprising the taking of a weighted sum of pixels; and multiplying the first signal and the second signal to form the product of the first signal and the second signal and multiplying the third signal and the product of the first signal and the second signal to produce an output video signal being the product of the first signal, the second signal and the third signal.
 2. A method according to claim 1, wherein said weighted sum is taken over pixels for the input video signal defined by a filter aperture.
 3. A method according to claim 2, wherein all three linear filtering operations have the same filter aperture.
 4. A method according to claim 1, wherein, for at least one linear filtering operation, the taking of a weighted sum of pixels includes the output pixel of the respective linear filtering operation.
 5. A method according to claim 1, wherein said three filtered signals are multiplied together without intervening filtering of the three filtered signals.
 6. Apparatus according to claim 5, wherein a filter is interposed between the output of the second multiplier and said combiner.
 7. A method according to claim 1, wherein a further linear filtering operation is conducted in parallel on the input video signal, with the result of said further linear filtering operation being combined with the multiplication product of said three filtered signals to produce an output video signal.
 8. A method of video signal processing, comprising the steps of conducting three linear filtering operations on an input video signal to produce three filtered signals, each linear filtering operation comprising the taking of a weighted sum of pixels; and multiplying together said three filtered signals to produce an output video signal, wherein the product of two of said filtered signals is formed and a linear filtering operation conducted on that product, prior to multiplication of said product by the third filtered signal.
 9. Apparatus for video signal processing comprising an input terminal for receiving an input video signal; first, second and third linear filters each connected with the input terminal and arranged to provide an output through taking a weighted sum of pixels; a first multiplier for multiplying together the respective outputs of the first and second filters; and a second multiplier for multiplying together the respective outputs of the first multiplier and the third filter to produce an output video signal.
 10. Apparatus according to claim 9, wherein said weighted sum is taken over pixels of the input video signal defined by a filter aperture.
 11. Apparatus according to claim 10, wherein said three linear filters have the same filter aperture.
 12. Apparatus according to claim 9, wherein at least one linear filter is arranged to take a weighted sum of pixels which includes the output pixel of the respective linear filter.
 13. Apparatus according to claim 9, wherein there is further provided a linear filter connected between the output of said first multiplier and the input to said second multiplier.
 14. Apparatus according to claim 9, wherein there is a direct connection between the output of said first multiplier and the input to said second multiplier.
 15. Apparatus according to claim 9, wherein the apparatus further comprises a linear filter path connected with the input terminal, and a combiner for combining the outputs of the linear filter path with the output of said second multiplier. 